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Abstract. We introduce a stochastic model in which adjacent planar regions A, B 
merge stochastically at some rate X(A, B), and observe analogies with the well-studied 
topics of mean-field coagulation and of bond percolation. Do infinite regions appear 
in finite time? We give a simple condition on A for this hegemony property to hold, 
and another simple condition for it to not hold, but there is a large gap between these 
conditions, which includes the case X(A, B) = 1. For this case, a non-rigorous analytic 
argument and simulations suggest hegemony. 
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1. Introduction 

We study random processes denned as follows. 

(i) at each time t the plane is partitioned into polygonal regions 

(ii) as t increases, adjacent regions A, B merge into one region AU B stochastically at 
some rate (probability per unit time) X(A,B). 

Here X(A, B) is a specified function of the geometry of regions A and B, typically a 
simple formula involving quantities such as the areas area(A), area(5), the perimeters 
(boundary lengths) peri(A), peri(_B) and the length L(A,B) of the boundary between 
A and B. The adjacency condition is enforced by the assumptions 

if L(A, B)>0 then X(A, B) > 0; if L(A, B) = then X(A, B) = 0. 

This is intended as an abstract model for spatial growth via merging - we mentally 
picture countries merging into empires. Despite its conceptual simplicity, this model 
has apparently never been studied, so let us name it the empire process. The purpose 
of this article is to initiate its study by setting out some simple analytic and simulation 
results, describing connections with other models, and providing suggestions for further 
study and methodology. 

1.1. Analogy with mean- field coalescence 

Consider moving particles of different masses x (we take the continuous case < x < oo), 
where particles of masses x and y may coalesce when they meet. Ignoring geometry 
(positions and motion of particles) and simply assuming there is a "relative propensity 
to meet and coalesce" function K(x, y) which applies to all pairs of particles, leads one 
to formulate a "mean-field" model. Write f(x,t)dx for the density per unit volume at 
time t of particles with mass in [x,x + dx]. The density function f(x,t) satisfies the 
well-known Smoluchowski coagulation equation PQ El E] 

rx poo 

f t f(x,t) = l K(y,x-y)f(y,t)f(x-y,t)dy-f(x,t) K(x,y)f(y,t)dy.(l) 
Jo Jo 

In the empire process the conserved quantity is area, so there is an analogous density 

f(a,t) of area-a regions per unit area. But - even when the merger rate function is of 

the form X(A, B) = K(aiea.(A), area(5)) - we cannot write down autonomous equations 

analogous to for J^/(a, t), because spatial relationships, and the stochastic nature of 

mergers, matter in our model. 

Versions (now called stochastic coalescents) with finite total mass and stochastic 

merging but no geometry have been studied [21 H] . And there is recent progress [5] in 

rigorous verification of the underlying presumption that in models of spatial diffusion 

and merging, the limiting (low density of massive particles) behaviour is as predicted 

by the Smoluchowski coagulation equation. Our empire process model is just different, 

but our general theme of relating qualitative properties of the process to properties of 

the rate function is obviously parallel to the same theme in mean-field coalescence. 
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1.2. Analogy with bond percolation 

To each edge e of the square lattice assign a random time T e , with Exponential 1) 
distribution P(T e < t) = 1 — e - ', at which the edge becomes "open". The configuration 
of open edges at time t is just the usual bond percolation [BJ [7] process with p = 
1 — e~*. There is an associated spanning forest process in which is included only those 
edges which (upon becoming open) link two distinct open components. A connected 
component of the bond percolation process with m+1 vertices contains a tree (within the 
spanning forest process) with m edges, and this connected component can be identified, 
as in Figure 1, with a region of area m in the dual square grid. 



Figure 1. The region associated with a component in bond percolation. 

The number of edges in the lattice between two adjacent components of the bond 
percolation process equals the boundary length between the two associated dual regions. 
In the bond percolation process as t increases to t + dt, each closed edge has chance dt 
to become open, so the bond percolation components merge at rate "number of closed 
edges linking the components" , and so in the dual process two regions A, B merge at 
rate L(A, B). So we have shown 

In the empire process with X(A, B) = L(A, B), started at t = with the unit 
squares of the square grid, the areas of regions at time t are distributed as 
the numbers of edges of the connected components of bond percolation with 
p{t) = 1 - e -t . 

The celebrated result (HI [9] that the critical value in bond percolation equals 1/2 implies 
that, in the particular empire process above, infinite regions appear at time log 2. 

2. Hegemonic or not? 



Whether or not the qualitative property 

infinite-area regions appear in finite time (2) 

holds is perhaps the most interesting question to ask about empire processes. In 
percolation theory the analogous property is described by phrases like "supercritical" or 
"percolation occurs" , while in coagulation theory it is called "gelation" , but to maintain 
the visualization of empires let us call (|2]) the hegemonic case. (See Notes below for 
details of interpretation of Can we give conditions, in terms of the merger rate 

function X(A, B), for whether the empire process is hegemonic or non-hegemonic? Here 
is one easy result. 
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Proposition 1 (i) Suppose (for some c> 0) we have 

X(A, B)>c L(A, B) for all A, B. 

Then the empire process is hegemonic, 
(ii) Suppose (for some C < oo) we have 

X(A, Bj)peii(Bj) < C peri(v4) for all A with adjacent regions (Bj). 

j 

Then the empire process is not hegemonic. 

See ([7]) for an explicit example of (ii). 

Before giving the simple proofs, we should point out a subtlety whose resolution 
will be useful. Consider two empire processes, where the second has faster rates than 
the first: 

\i(A,B) < X 2 (A,B) for all A,B. (3) 

The assertion 

if the Ai empire process is hegemonic and ^ holds then 

the A2 empire process is hegemonic (4) 

seems plausible at first sight, but we suspect it is not always true (see Notes below). 
What is true? If we can couple (define simultaneously) the two processes in such a 
way that the natural refinement-coarsening relationship is maintained (each region of 
the first is a subset of a region of the second) then the conclusion "if the first process 
is hegemonic then the second process is also hegemonic" is certainly correct. And the 
natural condition which ensures the relation can be maintained is the following. 

Bj) < X 2 (A, B), for all partitions (Ai) of A and (Bj) of 5.(5) 

This condition includes the case A\ = A, B\ = B and so ^ is a stronger assumption 
than @. 

To investigate when condition ^ might hold, consider a rate function with the 

superadditive property 

Xx(Ai, B)+X 1 (A 2 , B) < Xx(AxUA 2 , B), for all B and disjoint (A x , A 2 ).(6) 

This condition implies that ^ holds with A 2 replaced by Ai. So if we assume both 
(J3j) and superadditivity of Ai then we have ^ and the desired implication "if the first 
process is hegemonic then the second process is also hegemonic" is correct. 

Because the rate function L(A, B) is superadditive (in fact, additive) we can 
immediately deduce part (i) of the Proposition (the constant c is just time-scaling) 
from the bond-percolation fact. 

To prove (ii), fix an arbitrary reference point in the plane and consider the perimeter 
(X t , say) of the region lZ t containing the reference point at time t. The definition of the 
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empire process gives growth dynamics as follows, where (Bi) are the regions adjacent 
to lZ t , and T t denotes the history of the entire process up to time t. 

E{dX t \F t ) = ^(peri(ft t U B,) - peri(ft t ))A(7^, B t ) dt 

< ^peri( J B i )A(7^, J B i ) dt 

i 

< C peri(7^) dt by assumption (ii) 
= CX t dt. 

So jjj_EX t < C EX t and so -EX t < e ct EX§. So regions have finite mean size at all 
times. 



An explicit example where (ii) applies is 

L(A.B) 
max(peri(A), peri(£>)) 



HA,B)= , L(A : B) „ p „ . (7) 



Because in this case 

^2 Bj)peTi(Bj) < Y^HABj) = peri(A). 

3 3 

Any merger rate A' which is slower than this A in sense ^ will also be non-hegemonic, 
because case (ii) will apply. But, to repeat the warning at Q, in general if a merger 
rate is slower in sense (|3j) than a known non-hegemonic merger rate, then we cannot 
immediately deduce that it is also non-hegemonic. 

2.1. Notes on section^ 

Details of the definition of empire process. It is natural to assume the initial 
configuration is statistically translation-invariant, so that it will remain so at all times. 
In the section [4] simulations we started with the square grid for simplicity, though for 
certain theoretical analyses (section [3]) it is more convenient to use the hexagonal lattice, 
and as a continuum model it is perhaps conceptually more natural to start with a non- 
lattice configuration such as the Voronoi tessellation on random points. 

Details of the definition of hegemonic. Property (|2]) is imprecise in two ways, which 
we mention here without seeking a more precise analysis. First, we are presuming that 
the choice of initial configuration does not affect (|2]). Second, we are presuming that 
(J2j) has probability zero or one. Both presumptions are plausible by comparison with 
standard percolation results. 



Limitations of the coupling methodology. One might hope that the coupling 
methodology used to prove (i) might be useful more widely to prove hegemony under 
different assumptions. But, by considering small squares on either side of a boundary 
line, one sees that any superadditive rate function A must satisfy the hypothesis of (i). 
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Since we need superadditivity to use the coupling, one cannot handle any essentially 
different cases this way. 

Limitations of the methodology for (ii). The result in (ii) remains true if we replace 
"perimeter" by "area" or indeed any subadditive functional, but we don't know any 
interesting cases that can be handled by this variant and not by the stated form of (ii). 

Why might the implication fail? This very sketchy outline indicates why (|4]) is not 
obvious. Consider a hegemonic empire process with rate Ai, starting with the square 
grid. Write lZ t for the region containing a reference point. Take a sequence m^mj, . . . 
such that (with probability one) only finitely many values rrij are seen as values of 
area(7?-t). Then we should be able to modify Ai by redefining Xi(A, B) to be small 
whenever area(A) or area(5) equals some rrij, preserving the hegemonic property and 
the "only finitely many values rrij are seen" property. Now take A2 as the modification 
of Ai in which rates \2{A, B) are made very large when area(v4) + area(5) = some rrij. 
This modification should force the sequence aiea,{lZ t ) (for the A2 empire process) to 
re-enter the sequence (rrij) infinitely often and (having slow merger rates therein) to be 
non-hegemonic . 

3. Analysis of the case A = 1 

In this section we present a non-rigorous, though convincing, analytic argument that the 
constant rate case (X(A,B) = 1 for adjacent A, B) is hegemonic, and this is supported 
by simulations - see section |4} 

Note that a special feature of this case is that any initial boundary line will still 
be a piece of some boundary line at time t with probability exactly e~*. However, since 
these events for different initial lines are dependent, it is not clear how to exploit this 
formula directly. 

It is convenient (the reason is explained below) to take the initial configuration 
to be the hexagonal lattice, scaled so that boundary edges have length 1. Consider a 
circuit C which encloses a reference point b. Write !Z t for the boundary of the region of 
the empire process containing b at time t. As with the standard Peierls contour method 
P Ej in percolation, to prove hegemonic it is enough to prove that 



where the sum is over all circuits enclosing b. The length of a circuit must be even and 
> 6: write In for the length. The number of possible circuits enclosing b of length 2n 
can be bounded as order 2 2n without taking into account their self-avoiding property; 
but taking this into account reduces the bound to order 2^ 2 ~ 6 ^ n for some 5 > (see JTU] 
for discussion of the value of S, which is not important for our calculation). So if we can 




C 
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derive an upper bound P2n(t) for the probability that any particular contour of length 
2n is the boundary of TZ t , then it is enough to show 



To obtain such an upper bound P2n{t) consider Figure 2, which shows part of a 
circuit C which is present at time t, and shows the regions whose boundaries include 
part of C: shown are interior regions A,B,C,D and exterior regions W, X,Y, Z. 



Figure 2. Part of a contour C enclosing a point b, illustrating regions adjacent to the 
contour. 

This picture can change in two ways at time increases. 

(i) Two regions that are adjacent along the circuit - either exterior regions such as X, Y 
or interior regions such as B, C - may merge into one. This preserves the circuit and 
(typically) decreases the number of regions adjacent to the circuit by 1. 

(ii) An interior and exterior region - such as B, X - may merge, destroying the circuit. 
Note this is where it is convenient to start with the hexagonal lattice - the same position 
on C cannot separate two exterior regions and two interior regions, so with i exterior 
and j interior regions, the number of adjacent interior-exterior region pairs equals i + j, 
provided neither i nor j equals 1. 

We continue the analysis assuming the "typical" behaviour above always occurs - see 
Discussion below. Because each possible merger occurs at rate 1, the process (I t , J t ) 
where 

I t = number of exterior regions adjacent to the circuit 
J t = number of interior regions adjacent to the circuit 
behaves as the continuous-time Markov chain with transition rates 




(8) 



C 




c 



(hj) (i - 1 J) '■ rate * 
-> - 1) : rate j 
— > destroyed : rate i + j. 
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These rates correspond to the three possibilities that adjacent exterior regions merge; 
or adjacent interior regions merge; or an exterior and interior region merge. The latter 
possibility is the case that the circuit no longer uses only boundary edges of the empire 
process, and so in particular cannot be the boundary of the region containing b. 

Recall we are studying a circuit of length 2n in the hexagonal lattice, and it is 
easy to check such a circuit in this initial lattice has n + 3 exterior and n — 3 interior 
regions, so the initial state of the Markov chain is (Jo, Jo) = {n + 3, n — 3). Strictly, these 
rates only apply when i,j > 2, but allowing "fictitious" transitions 1 — > simplifies the 
analysis (see Discussion below). We can compare the Markov chain (I t , J t ) to the chain 
(/*, J t *) defined in the same way but without the "destroyed" possibility. Because in the 
process (It, Jt) the next transition has chance exactly 1/2 to be "destroyed", we find 

p(i t = i,j t = j) = ? + i- 2n p(i; = i, j* t = j). (9) 

We are interested in P2n(t) = P(Jt = 1 or 0) (the captures the fictitious transition), 
and combining ^ and ^ we see that we need to prove 

n— 3 /.qo 

2- Sn 21 / Pn + 3.n- 3 m = i, J: = J or 0) dt < oo (10) 

i=l J ° 

where we wrote P n+3j „_ 3 (-) as a reminder that the chain (7 t *, J t *) starts at (n + 3, n — 3). 

Now if the chain (7 t *, J t *) ever hits state (i, j) then it remains in that state for mean 
time l/(i + j), so we can write 

POO 

/ Pn+3,n-3(It = h J t = j ° r °) dt = [i + 1) _1 q n +3,n-z{h l ) + 2~V+3,n- 

where g n+3iri _ 3 (i, j) is the chance that the embedded discrete-time jump chain (X S ,Y S ) 
ever hits state But this jump chain is simply the chain obtained by drawing 

without replacement from a box which initially has n + 3 balls labeled "exterior" and 
n — 3 balls labeled "interior", and writing (X s , Y s ) for the number of remaining balls of 
each type after s draws. In particular, by considering reversed order of draws 

9 „ +3 ,„_3 (i ,l)^ i + 1 )^|i±|<C( i + 1 )2-. 

for some constant C, with a smaller similar bound for g n+3 n _ 3 (i, 0). We now see the 
inner sum in (10) is bounded by order n and thus the outer sum is indeed finite. 



3' 



3.1. Discussion of approximations above. 

What is assumed in setting up the rates for the continuous-time Markov chain is that 
the j successive exterior (and similarly for interior) regions touching the circuit C 

(a) are distinct 

(b) meet only if they are consecutive pairs along C 

(c) j > 3. 

Under these assumptions, the combined merger rate is indeed j. If instead of (c) we 
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have j : = 2 then the merger rate is 1, and if j — 1 the rate is 0; however it is not hard 
to see that these changed rates for j = 1,2 don't affect the "finiteness" result 

But the other simplifying assumption are more significant. If in Figure 2 the 
regions W and Y are adjacent, by meeting behind region X, then the combined merger 
rate is larger than j; moreover if they do merge then (a) fails for the merged region. 
The resulting "combinatorial explosion" of possibilities for successive regions along the 
boundary being distinct or identical seems very hard to analyze rigorously. 

4. Simulation results 

We will show simulation results for the following four rate functions. 

Model 1: X(A,B) = 1. 

Model 2: \{A,B) = area(A) x area( J B). 

Model 3: X(A, B) =L(A,B). 

Model 4: X(A, B) = l/(area(A) x area(£)). 

The quantities we calculated were the spatial averages 

S(t) = average (area of empire) 2 per unit area 
D(t) = average number of empires per unit area. 

Note the alternative interpretations: the average area of an empire (for uniform random 
choice of empire) equals 1/D(t), while the average area of the empire containing a 
random point (i.e. the average when empires are chosen wih probability proportional 
to area) equals S(t). We started at t — with the square grid, so 5(0) = -D(O) = 1; 
as t increases, S(t) increases and D(t) decreases. In order to compare different models 
it is convenient to plot the curve (D(t), S(t)), regarded as a function S(D). This is a 
natural way to summarize the non-hegemonic phase. If the model is hegemonic then, 
as D decreases from 1, S(D) will increase to infinity at some critical value D CIit > 0, 
whereas for a non-hegemonic model D crit = 0. 

There is a different way to summarize the hegemonic phase. Restrict the model to 
a large finite region of area A. Then as D decreases from 1 to 0, the quantity S(D)/A 
increases from 1/A to 1, and S(D)/A equals the chance that two random points in 
the region are in the same empire. In the A — > oo limit, the function S(D)/A will 
become a function f{D) analogous to the percolation function in percolation theory, 
with f(D) = for D > D CIit and f(D) > for D < D ctit . 

These two ways to think about D CIit are shown in Figure 3 for the four models. 
The upper graph in each pair shows S(D), and the lower graph shows S(D)/A, for two 
models. The points D CIit are where (in the A —>■ oo limit) the upper graph goes to 
infinity and where the lower graph leaves zero. Data are from simulations on an 81 x 81 
grid. 
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Figure 3. Simulation results for 4 models, explained in text. 
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The data from Figure 3 suggests the following rough numerical estimates, which 
we then compare to theory. The scaling exponents 7 will be discussed below. 
Model 1: [7 = 0]: D czlt m 0.15. This is consistent with the non-rigorous argument for 
hegemony (D crit > 0). 

Model 2: [7 = 2]: D CIit « 0.6. In this case Proposition [l](a) applies, so we know 
D CIit > 0. 

Model 3: [7 = 1/2]: D CIit « 0.2. This is the "bond percolation" case, so we know 
D CIit > 0. 

Model 4: [7 = —2]: D crit rs 0. In this case Proposition [T](b) applies, so we know 
D cilt = 0. 

Figure 4 shows emergence of the giant component in the case A = 1. The visual 
appearance is rather different from that of near-critical bond percolation, in that the 
large components appear less "fractal" and that distinct moderately large components 
coexist over longer time periods. 

Scaling exponents. In the context of mean-field coalescence, a kernel such that 
K(cx, cy) = c r K(x, y) is said to have scaling exponent 7. It has long been understood, 
mostly non-rigorously (but see [11] for references to recent rigorous work) that the 
coalescence process should be non-gelling if 7 < 1 but gelling if 7 > 1. 

We can define a scaling exponent 7 analogously for the empire process: X(cA, cB) = 
c 27 A(A, B), where cA denotes linear scaling by a factor c. The data above is consistent 
with the possibility that there is some critical value 7 crit such that empire processes are 
typically hegemonic for 7 > 7 crit and are typically non-hegemonic for 7 < 7 crit . However, 
the rate function at ^ which is known to be non- hegemonic has 7 = 0, as does the 
A = 1 case presumed to be hegemonic, so it may be that scaling exponents are not so 
definitive for empire processes. 
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